Skew Detection and Correction Technique for Arabic Document Images Based on Centre of Gravity
نویسندگان
چکیده
Problem statement: Skew detection and correction is the first step process in the document analysis and understanding processing steps. Correction the skewed scanned document image is very important, because it has a direct effect on the reliability and efficiency of the segmentation and feature extraction stages. The noises and the deviation in the document resolution or types are still the main two challenges facing the Arabic skew detection and correction methods. Approach: The proposed method work involved inscribing the text in the document by an arbitrary polygon and derivation of the baseline from polygon’s centroid. Results: The proposed method was implemented on 150 different scanned Arabic documents, from different sources like journals, textbooks, newspapers and the like in addition to handwritten document, with different resolutions and different fonts and it was obtained an accuracy ratio of 87%. Conclusion: The proposed method was efficient, simple and fast, it was not affected by noise and it was proved their suitability to work with documents with different fonts and documents with different resolutions.
منابع مشابه
Ultra High Speed Approach for Document Skew Detection and Correction Based On Centre of Gravity
Skew detection and correction (SDC) has a direct effect in efficiency and exactitude of documents’ segmentation and analysis and thus is considered as a very important step in documents’ analysis field. Skew is a major problem in documents’ analysis for every language. For Arabic/Persian document scripts this problem is more severe because of special features of these languages. In this paper a...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملSkew Detection Technique for Various Scripts
This paper includes the information about the technique used to detect Skew which are introduced during the scanning of the documents. It also discusses about the tool which have been used to implement the technique. The algorithm has been implemented on various scripts. The method provides a very efficient way to calculate the Skew. Correction in the skewed scanned document image is very impor...
متن کاملSkew Angle Estimation and Correction of Hand Written, Textual and Large areas of Non-textual Document Images: A Novel Approach
Skew angle estimation and correction of a document page is an important task for document analysis and optical character recognition (OCR) applications. Many approaches of skew detection can process pure textual document images successfully. But it is a challenging problem to process documents like handwritten, large areas of non-textual contents. In this direction, a novel approach for textual...
متن کاملSkew detection and correction in document images bsed on straight-line fitting
During document scanning, skew is inevitably introduced into the incoming document image. Since the algorithms for layout analysis and character recognition are generally very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. In this paper, a novel skew detection method based on straight-line fitting is proposed. And a co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009